hallucinatory gan
Addressing Sample Complexity in Visual Tasks Using HER and Hallucinatory GANs
Reinforcement Learning (RL) algorithms typically require millions of environment interactions to learn successful policies in sparse reward settings. Hindsight Experience Replay (HER) was introduced as a technique to increase sample efficiency by reimagining unsuccessful trajectories as successful ones by altering the originally intended goals. However, it cannot be directly applied to visual environments where goal states are often characterized by the presence of distinct visual features. In this work, we show how visual trajectories can be hallucinated to appear successful by altering agent observations using a generative model trained on relatively few snapshots of the goal. We then use this model in combination with HER to train RL agents in visual settings. We validate our approach on 3D navigation tasks and a simulated robotics application and show marked improvement over baselines derived from previous work.
Reviews: Addressing Sample Complexity in Visual Tasks Using HER and Hallucinatory GANs
Originality: The authors make clear the distinction from related work. They are not the first to integrate GANs for generating goals in RL, but do so in a new and interesting way. Quality: The comparison with baselines is thorough, showing the benefit of this approach for these domains. However, page 7 claims that HALGAN RL agents need fewer samples than standard RL, and yet in fact HALGAN must be exposed to enough samples of successful trajectories to be able to effectively hallucinate goal states. Are the 1000-samples used to train the HALGAN shown in Figure 3(f) 1000 examples of /goal/ states, or just states in general.
Addressing Sample Complexity in Visual Tasks Using HER and Hallucinatory GANs
Reinforcement Learning (RL) algorithms typically require millions of environment interactions to learn successful policies in sparse reward settings. Hindsight Experience Replay (HER) was introduced as a technique to increase sample efficiency by reimagining unsuccessful trajectories as successful ones by altering the originally intended goals. However, it cannot be directly applied to visual environments where goal states are often characterized by the presence of distinct visual features. In this work, we show how visual trajectories can be hallucinated to appear successful by altering agent observations using a generative model trained on relatively few snapshots of the goal. We then use this model in combination with HER to train RL agents in visual settings.
Addressing Sample Complexity in Visual Tasks Using HER and Hallucinatory GANs
Sahni, Himanshu, Buckley, Toby, Abbeel, Pieter, Kuzovkin, Ilya
Reinforcement Learning (RL) algorithms typically require millions of environment interactions to learn successful policies in sparse reward settings. Hindsight Experience Replay (HER) was introduced as a technique to increase sample efficiency by reimagining unsuccessful trajectories as successful ones by altering the originally intended goals. However, it cannot be directly applied to visual environments where goal states are often characterized by the presence of distinct visual features. In this work, we show how visual trajectories can be hallucinated to appear successful by altering agent observations using a generative model trained on relatively few snapshots of the goal. We then use this model in combination with HER to train RL agents in visual settings. We validate our approach on 3D navigation tasks and a simulated robotics application and show marked improvement over baselines derived from previous work.